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Abstract 

We address the problem of integrating information coming from 
different sources. The information consists of facts that a central 
server collects and tries to combine using (a) a set of logical rules, i.e. 
a logic program, and (b) a hypothesis representing the server's own 
estimates. In such a setting incomplete information from a source or 
contradictory information from different sources necessitate the use of 
many-valued logics in which programs can be evaluated and hypothe- 
ses can be tested. To carry out such activities we propose a formal 
framework based on bilattices such as Belnap's four-valued logics. In 
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this framework we work with the class of programs defined by Fit- 
ting and we develop a theory for information integration. We also 
establish an intuitively appealing connection between our hypothesis 
testing mechanism on the one hand, and the well-founded semantics 
and Kripke-Kleene semantics of Datalog programs with negation, on 
the other hand. 

Keywords : deductive databases and knowledge bases, information 
integration, logics of knowledge, inconsistency, bilattices. 

1 Introduction 

In several information oriented activities there is a need for combining (or 
"integrating") information coming from different sources. 

A typical example of such information-oriented activity is building a data 
warehouse, i.e. a special kind of very large database for decision-making 
support in big enterprises . The information stored in a data warehouse is 
obtained from queries to operational databases internal to the enterprise, and 
from remote information sources external to the enterprise accessed through 
the Internet. The answers to all such queries are then combined (by the 
so-called "integrator") to derive the information to be stored in the data 
warehouse. 

The basic pattern of the data warehouse paradigm, i.e. collection of infor- 
mation then integration, is encountered in many different situations. What 
changes usually from one situation to another is the type (and volume) of 
the collected information and the means used for the integration. 

In this paper we address a specific problem of information integration, 
namely, the information consists of facts that a central server collects from 
a number of autonomous sources and then tries to combine them using: 

• a set of logical rules, i.e. a logic program, and 

• a hypothesis, representing the server's own estimates. 

In such a setting incomplete information from a source or contradictory 
information coming from different sources necessitate the use of many-valued 
logics, in which programs can be evaluated and hypotheses can be tested. Let 
us see a simple example. 
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Example 1 Consider a legal case where a judge (the "central server") has to 
decide whether to charge a person named John accused of murder. To do so, 
the judge first collects facts from two different sources: the public prosecutor 
and the person's lawyer. The judge then combines the collected facts using 
a set of rules in order to reach a decision. For the sake of our example let us 
suppose that the judge has collected a set of facts F that he combines using 
a set of rules R as follows: 



witness (John) false 
friends(John, Ted) true 







R 



suspect(X) ^ mot ive (X ) V witness (X) 

innocent(X) ^ 3y(alibi(X, y) A ^friends(X, F)) 

friends(X, Y) ^ friends(F, X) V (friends(X, Z) A friends(Z, Y)) 

charge(X) ^ suspect (X) © -iinnocent(X) 



The first fact of F says that there is no witness, i.e. the fact witness (Jo/in) 
is false. The second fact of F says that Ted is a friend of John, i.e. the fact 
friends( Jo/in, Ted) is true. 

Turning now to the set of rules, the first rule of R describes how the 
prosecutor works: in order to support the claim that a person X is a suspect, 
the prosecutor tries to provide a motive and/or a witness. 

The second rule of R describes how the lawyer works: in order to support 
the claim that X is innocent, the lawyer tries to provide an alibi for X by a 
person who is not a friend of X. This rule depends on the third rule which 
defines the relation friends. 

Finally, the fourth rule of R is the "decision making rule" and describes 
how the judge works: in order to reach a decision as to whether to charge 
X, the judge examines the premises suspeci{X) and ^innocent{X) . As ex- 
plained earlier, the values of these premises come from two different sources: 
the prosecutor and the lawyer. Each of these premises can have the value 
true or false. However, it is also possible that the value of a premiss is un- 
defined. For example, if a motive is not known and a witness has not been 
found, then the value of suspect (X) will be undefined. 

^That notation means that the only facts which are defined (or assigned to logical 
values different from unknown) are witness(John) which is false and friends(John, Ted) 
which is true. 
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In view of these observations, the question is what value is appropriate 
to associate with charge(X). 

What we propose is to collect together the values of the premises sus- 
pect (X) and -iinnocent(X), and to consider the resulting set of values as the 
value of charge(X). This is precisely what the notation 

charge(X) ^ suspect(X) © -iinnocent(X) 

means, where © denotes the "collecting together" operation. 

It follows that there are four possible values for charge(X) : 0, {true}, 
{false} and {true, false}. We shall call these values : Under defined, True, 
False and Overdefined, and we shall denote them by U, T, T and O, 
respectively. 

The value Underde fined for a premiss means that the premiss is true or 
false but its actual value is currently unknown. For the purpose of this paper 
we shall assume that any premiss whose value is not known is associated with 
the value Underde fined. 

We note that the value Under defined is related to the so-called "null 
values" of attributes in database theory. In database theory, however, a 
distinction is made between two types of null values [Q: 

• the attribute value exists but is currently unknown; 

• the attribute value does not exist. 

An example of the first type is the Department- value for an employee that 
has just been hired but has not yet been assigned to a specific department, 
and an example of the second type is the maiden name of a male employee. 
The value Underde fined corresponds to the first type of null value. 

Returning now to our example, the decision whether to charge John de- 
pends on the value that charge (John) will receive when collecting the values 
of the premises together. Looking at the facts of F and the rules of R (and 
using intuition) we can see that suspect (John) and innocent (John) both re- 
ceive the value lA and so then does charge(John). 

This is clearly where the judge cannot decide whether to actually 

charge John! 

In the context of decision making, however, one has to reach a decision 
(based on the available facts and rules) even if some values are not defined. 
This can be accomplished by assuming values for some or all underdefined 
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premises. Such an assignment of values to underdefined premises is what we 
call a hypothesis. 

Thus in our example, if the judge assumes the innocence of John, then 
charge(John) receives the value false and John is not charged. We note that 
this is precisely what happens in real life under similar circumstances, i.e. 
the defendant is assumed innocent until proved guilty. 

Clearly, when hypothesizing on underdefined premises we would like our 
hypothesis to be "reasonable" in some sense, with respect to the available 
information, i.e., with respect to the given facts and rules. Roughly speaking, 
we define a hypothesis H to be "reasonable" or sound using the following 
test : 



calling a fact / defined under H ii H{f) ^ 14, 

1. add to F to produce a new set of facts F' — F U H; 

2. apply the rules of R to F' to produce a new assignment of values H'; 

3. if the facts defined under H are assigned to the same values in H' then 
H is sound, otherwise H is not sound. 

That is, if there is no fact of H that has changed value as a result of rule 
application then H is a. sound hypothesis; otherwise H is unsound. 
In our example, for instance, consider the following hypothesis: 



innocent (John) T 
charge(John) T 

Applying the above test we find the following values for the facts of Hi : 



innocent (John) T 
charge (John) 

As we can sec, the fact charge(John) has changed value, i.e. this fact had 
the value T in Hi and now has the value ^ in H[. Therefore, Hi is not a 
sound hypothesis. 

Next, consider the following hypothesis: 



H^ 



innocent (John) T 
charge (John) 

Applying again our test we find : 
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innocent (John) T 
charge (John) T 

That is, the values of the facts of H2 remain unchanged in i/g, thus H2 
is a sound hypothesis. 

Intuitively, if our hypothesis is sound this means that what we have as- 
sumed is compatible with the given facts and rules. 

From now on let us denote V the facts of F together with the rules of i?, 
i.e. V = {F, R), and let us call V a program. 

In principle, we may assume or hypothesize values for every possible 
ground atom. However, given a program V and a hypothesis H, we can- 
not expect H to be sound with respect to V, in general. What we can expect 
is that some "part" of H is sound with respect to V. 

More precisely, given two hypotheses H and H', call H a part of H', 
denoted H < H\ if H{f) ^ U implies E{j) = H'{f), i.e., if H agrees with 
H' on every defined fact. It is then natural to ask, given program V and 
hypothesis H, what is the maximal part of H that is sound with respect to 
V. We call this maximal part the support of H by V, and we denote it by 
Sp. Intuitively, the support of H indicates how much of H can be assumed 
safely, i.e., remaining compatible with the facts and rules of V. 

We show that the support can be used to define a hypothesis-based 
semantics of "P = {F,R), denoted by serrip. This is done by a fixpoint 
computation that uses an immediate consequence operator T as follows: 



Fo = F- 



. F,+i = T(F,)©4^,^). 

We also show that there is an interesting connection between hypothesis 
based semantics and the semantics of Datalog programs with negation. More 
precisely, we show that if P is a Datalog program with negation then: 

• if is the everywhere false hypothesis then serrip coincides with the 
well-founded semantics of V and 



if H is the everywhere underdefined hypothesis then sem^ coincides 
with the Kripke-Kleene semantics of "P . 
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As we shall see, these results allow us to extend the well-founded seman- 
tics and the Kripke-Kleene semantics of Datalog program with negation to 
the broader class of Fitting programs . 

Motivation for this work comes from the area of knowledge acquisition, 
where contradictions may occur during the process of collecting knowledge 
from different experts. Indeed, in multi-agent systems, different agents may 
give different answers to the same query. It is then important to be able to 
process the answers so as to extract the maximum of information on which 
the various agents agree, or to detect the items on which the agents give 
conflicting answers. 

Motivation also comes from the area of deductive databases. Updates 
leading to a certain degree of inconsistency should be allowed because in- 
consistency can lead to useful information, especially within the framework 
of distributed databases. In particular, Fuhr and RoUeke showed in that 
hypermedia information retrieval requires the handling of inconsistent infor- 
mation and non-uniform hypotheses. 

The remaining of the paper is organized as follows. In Section 2 we recall 
very briefly some definitions and notations from well-founded semantics, Bel- 
nap's logic J^OUTZ, bilattices and Fitting programs. We then proceed, in Sec- 
tion 3, to define sound hypotheses and their support by a Fitting program V; 
we also discuss computational issues and we present algorithms for comput- 
ing the support of a hypothesis by a program V and the hypothesis-founded 
semantics of V. In Section 4 we show that the notion of support actually 
unifies the notions of well-founded semantics and Kripke-Kleene semantics 
and extends them from Datalog program with negation to the broader class 
of Fitting programs. Section 5 contains concluding remarks and suggestions 
for further research. 



2 Preliminaries 

2.1 Three- valued logics 
2.1.1 Well founded semantics 



Well-founded semantics of logic programs were first proposed in [^. The well 
founded semantics of a Program V is based on the closed world assumption, 
i.e. every atom is supposed to be false by default. In the approach of pOl an 



interpretation / is a set of ground literals that does not contain literals of the 



7 



form A and -^A. Now, if we consider an instantiated program P defined as in 
pO| , its well-founded semantics is defined using the following two operators 



on partial interpretations / : 

• the immediate consequence operator Tp, defined by 

Tp(/) = {head{r) | r G P A VS G body{r), B G /}, and 



• the unfounded operator Up, where Up{I) is defined to be the greatest 
unfounded set with respect to the partial interpretation /. 

We recall that a set of instantiated atoms U is said to be unfounded with 
respect to / if for all instantiated atoms A E U and for all rules r E P the 
following holds: 

header) = A^3B e body{r) {^B e I V B e U) 

In it is proven that Up{I) = HB \ SPFp{I), where KB is the Her- 
brand Base and SPFp{I) is the limit of the increasing sequence [S'PF*(/)]j>i 
defined by: 

- SPF},{I) = {head{r) \ r e P Apos{body{r)) = 

AVS G bodylr),^B ^ 1} 

- SPF'p+\l) = {head{r) \ reP Apos{body{r)) C SPF'pil) 

AVS G bodylr),^B ^l},i>0. 



The atoms of SPFp{I) are called potentially founded atoms. 

The operator Wp, called the well-founded operator, is then defined by 
Wp{I) = Tp{I) U -^Up{I) and is shown to be monotone with respect to set 
inclusion. The well-founded semantics of P is defined to be the least fixpoint 

of Wp m. 



2.1.2 Kripke-Kleene semantics 

The Kripke-Kleene semantics was introduced in 0. In the approach of 
a valuation is a function from the Herbrand base to the set of logical values 
{true, false, unknown}. Now, given an instantiated program V defined 
as in 0, its Kripke-Kleene semantics is defined using an operator $p on 
valuations, defined as follows : given a ground atom A, 
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• if there is a rule in V with head A, and the truth value of the body 
under v is true, then = true; 

• if there is a rule in V with head A, and for every rule in V with head 
A the truth value of the body under v is false, then (^■p{v){A) = false; 

• else ^■p{v){A) = unknown. 

The Kripke-Kleene semantics of a Program V is based on the open world 
assumption, i.e. every atom is supposed to be unknown by default, and 
is defined to be is the iterated fixpoint of $p obtained by beginning the 
iteration with the everywhere unknown valuation. 

2.2 Mult i- valued logics 
2.2.1 Belnap's four- valued logic 

In 0, Belnap defines a logic called J-'OUIZ intended to deal with incomplete 
and inconsistent information. Belnap's logic uses four logical values that we 
shall denote by JF, T, U and O , i.e. J-'OUIZ = {JF, T, U, O}. These values 
can be compared using two orderings, the knowledge ordering and the truth 
ordering. 

In the knowledge ordering, denoted by <fc, the four values are ordered as 
follows: U <fc JF, W <fc T, J-^ <k O, T <k O. Intuitively, according to this 
ordering, each value of J-'OUIZ is seen as a possible knowledge that one can 
have about the truth of a given statement. More precisely, this knowledge 
is expressed as a set of classical truth values that hold for that statement. 
Thus, J-' is seen as {false}, T is seen as {true), U is seen as and O is seen 
as {false,true}. Following this viewpoint, the knowledge ordering is just the 
set inclusion ordering. 

In the truth ordering, denoted by <t, the four logical values are ordered 
as follows: J-^ <t U, J-' <t O, U <t T, O <t T . Intuitively, according to 
this ordering, each value of TOUTZ is seen as the degree of truth of a given 
statement. U and O are both less false than T , and less true than T, but U 
and O are not comparable. 

The two orderings are represented in the double Hasse diagram of Fig- 
ure 1. 

Both <t and <fc give J^OUIZ a lattice structure. Meet and join under the 
truth ordering are denoted by A and V, and they are natural generalizations 
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Figure 1: The logic FOUR 

of the usual notions of conjunction and disjunction. In particular, UA0= T 
and U\/0= T . Under the knowledge ordering, meet and join are denoted by 
® and ©, and are called the consensus and gullibility , respectively: x ® y 
represents the maximal information on which x and y agree, whereas x ® y 
adds the knowledge represented by x to that represented by y. In particular, 
J^®T = U and J^®T = O. 

There is a natural notion of negation in the truth ordering denoted by 
-1, and we have: -i T= J-', -> JF= T, -i U= U, -> 0= O. There is a similar 
notion for the knowledge ordering, called conflation, denoted by -, and we 
have: -U=0,- 0= U, -T=T,- r= T. 

The operations V, A, -i restricted to the values T and T are those of clas- 
sical logic, and if we add to these operations and values the value lA then 
they are those of Kleene's strong three-valued logic. 



2.2.2 Bilattices 



In 1^, |T^, bilattices are used as truth- value spaces for integration of infor- 
mation coming from different sources. The bilattice approach is a basic 
contribution to many-valued logics. Bilattices and their derived sublogics 
are useful in expressing uncertainty and inconsistency in logic programming 
and databases 0, ^, ^ [1^, |T3], |TB|. The simplest non-trivial bilattice is called 



FOUR, and it is basically Belnap's four- valued logic 0. 

Definition 1 A bilattice is a triple {B, <t, <k) , where B is a nonempty set 
and <t, <k are each a partial ordering giving B the structure of a lattice with 
a top and a bottom. 

In a bilattice {B, <t, <k), meet and join under <t are denoted V and A, 
and meet and join under <k are denoted © and ®. Top and bottom under 



10 



<t are denoted T and JF, and top and bottom under <k are denoted T and 
U. If the bilattice is complete with respect to both orderings, infinitary meet 
and join under <t are denoted V and A, and infinitary meet and join under 
<k are denoted and 

Definition 2 A bilattice {B, <t, <k) is called distributive if all 12 distributive 
laws connecting V, A, © and ® hold. It is called infinitely distributive if it 
is a complete bilattice in which all infinitary, as well as finitary, distributive 
laws hold. 

An example of a distributive law is x ® {yM z) = (x ® V (a; ® z). An 
example of an infinitary distributive law is x®V{?/iK ^ S} = \/{x®yi\i G S}. 

Definition 3 A bilattice {B, <t, <k) satisfies the interlacing conditions if 
each of the operations V, A, © and © is monotone with respect to both 
orderings. If the bilattice is complete, it satisfies the infinitary interlacing 
conditions if each of the infinitary meet and join is monotone with respect to 
both orderings. 

An example of an interlacing condition is: Xi <t yi and X2 <t ?/2 implies 
Xi © X2 <t Vi © I/2- An example of an infinitary interlacing condition is: 

<t Ui for all z e implies 0{xi|z G S} <t ®{yi\i G S"}. A distributive 
bilattice satisfies the interlacing conditions. 

J-'OUTZ is an infinitary distributive bilattice which satisfies the infinitary 
interlacing laws. A bilattice is said to be nontrivial if the bilattice FOUR 
can be isomorphically embedded in it. 

There are two principal ways for constructing bilattices that were intro- 



duced in [jT2[ , and then developped in details in p . The first one consists in 
considering two lattices (Li, <i) and {L2, <2)- We can see Li as the set of 
values used for representing the degree of belief (evidence, confidence, etc.) 
of an information and L2 as the set of values used for representing the degree 
of doubt (counter-evidence, lack of confidence, etc.) of the information. 

We define the structure Li © L2 to be the structure (Li x L2,<t, <k) 
where: 

• {x, y) <t {z, w) iS X < z and w < y, 

{{x,y) A {z,w) = {min{x, z), max {y,w))), and 
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• {x, y) <k {z, w) iS X < z and y < w 
{{x,y) {z,w) — {'min{x, z),min{y,w))). 

Li L2 is a bilattice satisfying the interlacing conditions; it is a com- 
plete bilattice satisfying the infinitary interlacing conditions if Li and L2 are 
complete; it is infinitely distributive if Li and L2 are complete and infinitely 
distributive. Moreover, ii L = Li = L2, then a negation can be defined by 

-'{x,y) = {y,x). 

The following example illustrates possible uses of such a bilattice. 

Example 2 Suppose that we have two information sources: two veterinaries 
Vi and V2, and that we want to know the answer to the query: Is Marguerite 
a crazy cow ? 

Ifvi asserts that the probability she is mad is 70%, and V2 asserts that the 
probability she is not mad is 40%, then this knowledge can be represented by 
assigning to the atom Mad(Marguerite) the logical value (0.7,0.4) e [0; 1] x 
[0;!]. 

Such values could also be useful when each source can only answer by true 
or false, but is associated to a specific degree of reliability. The value (0.7, 0.4) 

could then represent the fact that Vi asserts that she is mad whereas V2 asserts 
that she is not mad, but that we are more confident in diagnostics of Vi than 
in those of V2. That difference of reliability or confidence being represented 
by the assignation of different degrees of reliability to information sources, in 
our example, vi would be supposed reliable for 70% and V2 for 40%. 

The second way of constructing a bilattice consists in interpreting values 
as approximations of exact values. Suppose we have a lattice {L, <i) of 
truth values. An approximation of a truth value can be seen as an interval 
[a, — {x \ a <L X <L h} containing that value. We can provide to the set 
of intervals a structure of bilattice {0{L), <<, <k) such that, for [a, 6], [c, d\ e 
0{L): 

• [O't &] <fe [c, d\\i a <L c and d <l b, and 

• [o, b] <t [c, d] if a <L c and b <l d. 

The intuition is that knowledge increases if the interval become shorter 
and truth increases if the interval contains greater values. 
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By abuse of notation we will sometimes talk about the bilattice B when 
the orders are irrelevant or understood from the context. From now on, we 
assume that B is an infinitely distributive bilattice that satisfy the infinitary 
interlacing conditions and has a negation unless explicitly stated otherwise. 

2.2.3 Fitting programs 

Conventional logic programming has the set {JF, T} as its intended space of 
truth values but since not every query may produce an answer partial models 
are often allowed (i.e. U is added). If we want to deal with inconsistency as 
well then O must be added. Thus Fitting asserts that J-'OUTZ can be thought 
as the "home" of ordinary logic programming and extends the notion of logic 
program, as follows: 

Definition 4 (Fitting program) 

• A formula is an expression built up from literals and elements of B, 
using A, V, 0, ©, 3, V. 

• A clause is of the form P{xi, ...,Xn) < — (p{xi, Xn) , where the atomic 
formula P{xi, ...,Xn) is the head, and the formula (f){xi, ...,Xn) is the 
body. It is assumed that the free variables of the body are among 

X\ , . . . , Xfi . 

• A program is a finite set of clauses with no predicate letter appearing 
in the head of more than one clause (this apparent restriction causes 
no loss of generality ^). 

We shall represent a Fitting program as a pair (F, R) where F is a function 
from the Herbrand base into B and R a set of clauses. This is possible because 
every fact can be seen as a rule of the form A ^ where A is an atom and 
w is a value in B. 

A Datalog program with negation can be seen as a Fitting program whose 
underlying truth- value space is the subset {JF, T ,U} oiB and which does not 
involve 0, ©,V,Z^, 0,T. 

3 Hypothesis Testing 

In the remaining of this paper, in order to simplify the presentation, we 
assume that all Fitting programs are instantiated programs. Moreover, we 



13 



use the term "program" to mean "Fitting program" , unless explicitly stated 
otherwise. 



3.1 Interpretations 

First, we introduce some terminology and notation that we shall use through- 
out the paper. Given a program P, call interpretation of V any function / 
over the Herbrand base TCB-p such that, for every atom A of TCB-p, I{A) is a 
value from B. 

Two interpretations / and J are compatible if, for every ground atom A, 
{I (A) ^ U and J {A) ^U)^ I {A) = J (A). 

An interpretation / is a part of an interpretation J, denoted / < J, if 
I{A) ^ U implies I {A) = J{A), for every ground atom A. Clearly, the part-of 
relation just defined is a partial ordering on the set V{B) of all interpretations 
over B. Given an interpretation I, we denote by def{I) the set of all ground 
atoms A such that ^ U. Moreover, if S is any set of ground atoms, we 
define the restriction of I to S, denoted by I/s as follows: for all A e TiB-p, 



I{A) HAeS, 
U, otherwise. 



We can extend the two orderings of B (i.e. the truth ordering and the 
knowledge ordering) to the set V{B) as follows: Let h and I2 be in V(B), 
then 

• /i <t I2 if and only if Ii{A) <t hiA) for all ground atoms A; 

• h <fc h if and only if Ii{A) <k hi^) for all ground atoms A. 

Under these two orderings V{B) becomes a bilattice, and we have 
(/ A J){A) = I (A) A J (A), and similarly for the other operators. V{B) 
is distributive, satisfies the interlacing conditions and has a negation and a 
confiation. 

The operations of B can be extended naturaly to V{B) in the following 
way: / © J{A) = I{A) © J{A^) and similarly for the other operations. 

The actions of interpretations can be extended from atoms to formulas 
as follows: 

• I{X AY) — I{X) A I{Y), and similarly for the other operators. 
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• H{^x)4>{x)) = yt=closedtermH4>{t)), and 

=closedterm 

If 5 is a closed formula then we say that B evaluates to the logical value 
a, with respect to an interpretation /, denoted hy B = a w.r.t. / or by 
B =1 a, if J{B) = a for any interpretation J such that I < J (i.e. if the 
value of B is equal to a with respect to the defined atoms of / whatever 
the values of undcrdcfined atoms could be). There are formulas B in which 
underdcfincd atoms do not matter for the logical value that can be associated 
with B. For example let us take B = A\J C and let the interpretation / be 
defined by I{A) = U, I{C) = T; then no matter how A is interpreted B is 
evaluated to T, that is, B =/ T. 

Given an interpretation /, let lo be the interpretation defined by : if 
I{A) ^ U then /ci(A) = 1{A) else /ci(A) = C, for every atom A. Using the 
interlacing conditions, we have the following lemma that provides a method 
of testing whether B =i a, based on the interpretation Iq. 

Lemma 1 Given a dosed formula B, B =i a iff I{B) — a and Io{B) — a. 
3.2 The Support of a Hypothesis 

Given a program P = {F, R), we consider two ways of inferring information 
from V. First by activating the rules of R in order to derive new facts from 
those of F, through an immediate consequence operator T. Second, by a 
kind of default reasoning based on a given hypothesis. 

Definition 5 (immediate consequence operator T) The immediate con- 
sequence operator T takes as input the facts of F and returns an interpreta- 
tion T{F), defined as follows: for all ground atoms A, 

• if there is a rule A <— B & R, then Tr{F){A) = a if B =p a, 

• Tr{F){A) ^U, otherwise. 

What we call a hypothesis is actually just an interpretation H. However, 
we use the term "hypothesis" to stress the fact that the values assigned by 
H to the atoms of the Herbrand base are assumed values - and not values 
that have been computed using the facts and rules of the program. As such, 
a hypothesis H must be tested against the "sure" knowledge provided by V. 
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The test consists of "adding" H to F, then activating the rules of V (using 
T) to derive an interpretation H'. If H < H', then the hypothesis if is a 
sound one, i.e. the values defined by H are not in contradiction with those 
defined by V. Hence the following definition: 

Definition 6 (Sound Hypothesis) Let V = {F, R) be a program and H a 
hypothesis. H is sound w.r.t. V if 

• F and H are compatible, and 

• H/Head{p) < T{F © H), where HeadiV) = {A\ 3A^BeV}. 

We use the restriction of H to Head{V) before making the comparison 
with T{F(BH) because all atoms which are not head of any rule of V will be 
assigned to the value Under defined by T{F © H). Then H and T{F © H) 
are compatible on these atoms. 

The following example illustrates the definition of sound hypothesis with 
respect to a logic program. 

Example 3 We consider the program V = (JF, TZ) defined by : 

witness (Jean) T Q 



R 



suspect(X) <— motive{X) V witness{X) 

innocent{X) <- JY {alibi{X ,Y) ^ ^friends{X ,Y)) 

friends{X, Y) ^ friends{Y, X) V {friends{X , Z) A friends{Z, Y)) 

charge{X) <— suspect{X) © -^innocent{X) 



Let H be the following hypothesis : 

witness (J can) T 

^_ motive(Jean) T 

suspect(Jean) T 

innocent(Jean) T 



^That notation means that the only atom which is assigned to a logical value different 
from in is "witness(Jean) and that its value is T. 
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We can easily note that H is not sound with respect to V. The atom 
witness (Jean) is defined in H and in F, hut with different values, so H and 
F are not compatible. 

The maximal part of H that is compatible with F is 



H' 



motive(Jean) T 
suspect (Jean) T 
innocent (Jean) T 



F and H' are compatible, so it is possible to collect the knowledge de- 
fined by these two interpretation in a new one without creating conflicts or 
inconsistencies. 



witness (J can) T 

motive (J can) T 

suspect(Jean) T 

innocent(Jean) T 



Then we activate the rules of R on the interpretation F (B H' 



Tr{F®H') = 



witness(Jean) T 

motive(Jean) T 

suspect (Jean) T 

charge(Jean) T 



We observe that H' is not sound with respect to V because H' is not a 
part ofTji{F ® H) and is in contradiction with the derived knowledge. 

Even if a hypothesis H is not sound w.r.t. P, it may be that some part 
of H is sound w.r.t. V. Of course, we are interested to know what is the 
maximal part of H that is sound w.r.t. V. We shall call this maximal part 
the "support" of H . To see that the maximal part of H is unique (and thus 
that the support is a well-defined concept), we give the following lemma: 

Lemma 2 If Hi and H2 are two sound parts of H w.r.t. V, then Hi © H2 
is sound w.r.t. V. 

Proof. Hi and if 2 are both restrictions of H so they are compatible. 
Moreover, Hi and H2 are two sound parts of H with respect to P, so Hi and 



17 



are both compatible with F. It follows that Hi © H2 is compatible with 

F. 

We also have -^l/Heads(7^) — -^(-^ ® -^i)' atom ^4 head of a 

rule A ^ 5 G P, if 7^ W then Hi{A) = T{F © The same 

property is verified by H2. If Hi © H2{A) ^ U then we have : 

• either Hi{A) ^ U and //2(A) = W, 

• either i/al^) 7^ U and = W, 

• either Hi{A) = H2{A)^U. 

In the first case, we have T{F © Hi){A) = Hi{A) = Hi ® //2(A), i-e. 
B = Hi® H2{A) p.r. F®Hi,soB = Hi® H2{A) w.r.t. F®Hi® H2. We 
have the same result in the two other cases, so 

(^i®^2)/Heads(7e)<^(^®^i®^2). 

□ 

Thus the maximal sound part of H is defined hy ®{H' \ H' < H and H' 
is sound w.r.t. V}. 

Definition 7 (Support) Let V he a program and H a hypothesis. The 
support of H w.r.t. V , denoted , is the maximal sound part of H w.r.t. V 
(where maximality is understood w.r.t. the part- of ordering <). 



Example 4 Let V be the program and H the hypothesis defined in the ex- 
ample then the support of H with respect to V is : 



H 



motive(Jean) T 



We can remark that the support of a hypothesis with respect to a program 
V = {R, F) is compatible with the interpretation obtained by activating the 
rules of R on the facts of F. 

Lemma 3 Let V = {R, F) be a logic program and H a hypothesis. Tr(F) 
and Sp are compatible. 
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Proof. For all atom A, if A is not the head of any rule of R, then 
Tr{F){A) < s^{A). If there is in a rule forme A -s— then : 

• \is^{A) = H{A) ^ U, then B = a w.r.t. F®s^, and if Tr(F)(A) ^ U, 
then Tr{F){A) = H{A) ; 

• if rR(F)(A) = a^U, then S = a w.ri. F and S = a w.r.t. F © s^, 
and if s^{A) ^ U, then (A) = a. 

□ 

We now give an algorithm for computing the support of a hypothesis 
H w.r.t. a program V. 

Consider the following sequence {PF^), z > 0: 

• PF^ = 0; 

• PFi = {A I A ^ S e P and S ^ t/;.r.t F®H /ijiBr\iF{F,H))\PF,.,} 
for all i > 0, 

where IF{F, H) is teh set of facts that are imcompatible with H, de- 
fined by IF{F, H)^{A\ {F{A) ^ U)a{H{A) ^ U)a{F{A) ^ H{A))}. 

The intuition here is that we want to evaluate step by step the atoms 
that could potentially have a logical value different than their values in H. 
We have the following results: 

Proposition 1 The sequence (PFi), i > is increasing with respect to set 
inclusion and it has a limit reached in a finite number of steps. This limit is 
denoted PF. 

Proof. We show by recurrence that for all n, PFn-i C PF„. 
PFq = so the property is satisfied for n = 1. 

Suppose that PFi_i C PF^. Thus we have 

H/(HB-p\IF{F,H))\PFi < H/^-HBr\IF{F,H))\PFi_i- 

For all atom A, if there is in a rule A ^ B, then if B ^ H(A) 
with respect to F © H /{HB-p\iF{F,H))\PFi^i, then B ^ H{A) with respect to 
F© H/(^Br\iF{F,H))\PFi, and consequently, PFi ^ {A \ A ^ B e V and 
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B ^ H{A) with respect to F ® H/(nBv\iF{F,H))\PFi_A C {A \ A ^ B e V 
and B ^ H{A) with respect to F ® H/(nBr\iF{F,H))\PFi} = P^i+i- 

□ 

If an atom of the Herbrand base is not in PF, then it means that, with 
respect to V, there is no way of inferring for that atom a logical value different 
than its value in H w. 

Theorem 1 Let V a logic program and H a hypothesis, we have 

= H n^Bv\IF{F,H))\PF 

Proof. We note X — H /(y^i3^\iF{F,H))\PF- Firstly, we show that X is a 
sound part of H with respect to V. By definition, X and F arc compatible. 
Let A be an atom such that there exists in i? a rule A ^ B and such that 
X{A) = H{A) U. Then A ^ PF and 5 = H{A) p.r. F ® X, so X is 
sound. 

Secondly, we prove that X is the maximal sound part of H with respect 
to V. Let y be a sound part of H. We show by recurrence that for all atom 
A, if Y{A) = H{A) ^ U, then A ^ PF^. 

If A is not the head of any rule in R, then A ^ PFi, for all i. PFq = so 
A ^ PFq. Suppose the property satisfied ior n — i — We have, for all atom 
A, if Y{A) = H{A) ^ U, then A ^ PFi_i, and Y < H/^nBr\mF,H))\PF,.,. 

If there is a rule A ^ 5 in i? and Y{A) = H{A) ^ U, then 
B = H{A) w.r.t. F (BY because Y is sound, and it follows that 
B = H{A) w.r.t. F © H/(^}iB-p\iF{F,H))\PF,^i- We can conclude that A ^ PF^. 

We have shown that for all atom A, if Y{A) = H{A) ^ U, then A ^ PF 
and consequently, X{A) — H{A). 

For all sound part y of if, we have Y < X. 

□ 



4 Hypothesis Founded Semantics 

As we explained earlier, given a program P — {F,R), we derive informa- 
tion in two ways: by activating the rules (i.e. by applying the immediate 
consequence operator T) and by making a hypothesis H and computing its 
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support w.r.t. V. In the whole, the information that we derive comes 
from T(F) © s^. 

Now, roughly speaking, the semantics that we would like to associate with 
a program V is the maximum of information that we can derive from V under 
a sound hypothesis but without any other information. To implement this 
idea we proceed as follows: 

1. As we don't want any extra information (other than V and s^), we use 
the everywhere undefined interpretation, call it lu- 

2. In order to actually derive the maximum of information from V and In, 
we collect together the knowledge infered by activating the rules of R, 
i.e. by applying the operator Tr, and as much of assumed knowledge 
as possible, i.e. the support of H w.r.t. V. 

Proposition 2 The sequence (Fn), n>0 defined by : 

• Fq = F , and 

is increasing with respect to < and has a limit denoted by sem^. 

Proof. It is straighforward that is monotonic with respect to <, so for 
all n, Tr{F^)<Tr{F^+,). 

We prove by recurrence that for all n, -s^^^^ < s^p^^_^^j^y 

For n = 0, if {A) = a^U then : 

• if A is not the head of any rule in i?, then s^p^^^^A) — a ; 

• if there is a rule A <— B in i?, then B = a w.r.t. ^(f^r), 
so B = q; w.r.t. s^p j^^ ® Tji{F) and B = a w.r.t. F^. 

It follows that B = a w.r.t. Fi ® s^p^ j^y Consequently, s^p^ = a. 

The property is true for n = 0. 

Now, suppose that < sfp^ j^y If sfp^ j^^iA) ^aj^U, then 

• if v4 is not the head of any rule in R, then sfp ^^ {A) = a ; 
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• if there is a rule A ^ B in R, then B = a w.r.t. F^-i © ^fpn-r r)- 

For all n, Fn®sfp^^^^ < TniFn) ® sfp^j^y Indeed, if Fn® sfp^^^j^^iA) = a^U, 
then : 

• T^^Fn-ij^A) = a, and it follows that Tr(F„)(A) = a, 
because rR(F„_i) <Tii{Fn) ; or 

• J?>(^) ~ ^^"^ follows that s^p^ _R,)(^) = ct) 
because < ; or 

Consequently, B = a w.r.t. Tn^Fn-i) © i.e. B = a w.r.t. Fn, thus 

i? = a ty.r.t. F„ © s|^^ Finally = Q^- 

So we have for all n, Tn^Fn) < Tn^Fn+i) and sfp^j^^ < sfp^^^^^^ following 
the lemma ^ we can conclude that for all n, F„ < -Fn+i- 

□ 



Proposition 3 The interpretation serrip is a model ofV. 

Proof. serup = Tji{semf^) © s^^^h j^y But serup is a sound hypothesis, 
so, by definition, we have 

S(se<,i?>/Heads(R) " ^^^^^^^ ® ^(sem'^fi))- 

It follows that 
i.e. 

Thus, for all rule A <^ B E R, semp{A) = Tji{sem!p){A) = a if B = a 
with respect to serrip, so serrip is a model of P. 

□ 

This justifies the following definition of semantics for V. 
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Definition 8 (Hypothesis founded semantics of V) The interpretation 
serrip is defined to be the semantics ofV w.r.t. H or the H -founded seman- 
tics ofV. 

Following this definition, any given program V can be associated with 
different semantics, one for each possible hypothesis H . Theorem 2 below 
asserts that this approach extends the usual semantics of Datalog programs 
with negation to a broader class of programs, namely the Fitting programs. 

Two remarks are in order here before stating Theorem 2. First, if we 
restrict our attention to three values only, i.e. JF, T and W, then our definition 
of interpretation is equivalent to the one used by Van Gelder et als in 
the following sense: given an interpretation / following our definition, the 
set {A I I{A) = T} U {~'A \ I{A) = JF} is a partial interpretation following 



20| ; conversely, given a partial interpretation J following |20], the function 



/ defined by: I{A) = T if A G J, /(A) = if G J, and I{A) = U 
otherwise, is an interpretation in our sense. 

Second, if we restrict our attention to Datalog programs with negation 
(recall that the class of Fitting programs strictly contains the Datalog pro- 
grams with negation) then the concept of sound interpretation for the ev- 
erywhere false hypothesis reduces to that of unfounded set of Van Gelder et 
als |2y]. The difference is that the definition in pO| has rather a syntactic 



fiavor, while ours has a semantic fiavor. Moreover, our definition not only 
extends the concept of unfounded set to multi-valued logic, but also gener- 
alizes its definition to any given hypothesis H (not just the everywhere false 
hypothesis) . 

Theorem 2 Let V he a Datalog programs with negation. 

1. If Hjr is the everywhere false hypothesis, then sem^^ coincides with 
the well-founded semantics ofV; 

2. If Hu is the everywhere underdefined hypothesis, then serup^ coincides 
with the Kripke-Kleene semantics ofV. 

Proof. A Datalog programs with negation can be seen, in our approach, 
as a set of rules of the form 

A ^ (Li,i A ... A Li,„) V ... V (L,,i A ... A L,,„), 
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where the Lp ^ are htterals. 

First part. Let wfs{V) be the well-founded semantics of V. We use in 
this part the definition of wfs{V) given in section |2.1.1| , considering all the 
interpretations as functions from HB-p in {J^,U,T}. 

Firstly, we show that wfs{V) = TR{wfs{V)) © sj^^(p)^^^. 

We know that wfs{V){A) = T if and only if there is in P a rule 

A ^ A ... A Li,„) V ... V A ... A L^,™), 

such that there is {Lp i A ... A Lp g) G {(-^1,1 A ... A Li n); A ... A Li ^)} 

such that wf s{V){Lpi) = ... = wfs{V){Lpq) = T, i.e. if and only if 
TniwfsiV))iA)=T/ 

We know that wfs{V){A) = JF if and only if : 

• either A is not the head of any rule of R, and then s^^^^p^ {A) = T ; 

• either there is in P a rule A ^ (Lij A ... A I/i,n) V ... V (Lj 1 A ... A Li^m), 
such that for all (Lp^i A... ALp^^) G {(Li^i A... ALi^„); (Lj^i A... ALj^m)} 
there is Lp^fc G {Lp^i, Lp^g} such that wfs{V{Lp^k) = ^, i-e. if and 
onlyifrH(w;/s(P))'(A) = ^. 

Consequently, wfs{V) = Tji{wfs{V)) © s^wfs{v) b)- follows that 

sem^^ < wfsiV). 



We prove now than sem-p^ < T^{semp^) U W-p(semp-^). 

We know that sem!^^{A) = T if and only if there is in P a rule 

A ^ A ... A Li,„) V ... V A ... A L^,^), 

such that A... ALi^„)V...V(Lj_i A... ALj^m)) = T w.r.t. sem^^ , i.e. such 

that there is (Lp 1 A... ALp^^) G {(Li^i A... ALi „); (Lj^iA... ALj such that 
sem^^ (Lp i) = ... = sem^^ (Lpq) = T, i.e. if and only if T-p{semp^){A) = 
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r. 



We know that sem-p^{A) = T ii and only if : 

• either A is not the head of any rule of R, and then U-p(sem^^)(A) — T\ 

• either there is in a rule A -s— (Li^i A ... ALi_„) V ... V (Lj^i A ... A Li,^,)) 
such that ((Li^i A ... A Li^„) V ... V (Lj^i A ... A Li,^)) = T w.r.t. sem^^ , 
i.e. for all (Lp,i A ... A Lp^q) e {(Li,i A ... A Li,„); (L^,! A ... A Li,„)} 
there is Lpfc G ...,Lp,j} such that sem^^{Lp^k) = i-e. if and 
only a U'p{semp^){A) = T. 

Consequently, senvp" < T^{semp^) iJUp{sem^^), thus wfs{V) < sem^^. 

Second part. Let I be an interpretation, we show that Tji{I) — where 
$p is the Kripke-Kleene operator. 

Then ^'p{I)[A) = a y^U if and only if there is in P a rule A ^ B, where 
B is defined by (Li^i A ... A Li,„) V ... V (Lj^i A ... A Li,^), and if : 

• either there is (Lj_i A ... A Lj^k) Q B such that Lj^i — ... = Lj^k = 

• either for all {Lj^i A ... A Lj^k) Q B, there is L^y e {-^j.i; such 
that ^ = 

Thus $-p(J)(74) = q; 7^ W if and only if there is in a rule A B such that 
B =1 a, i.e. if and only if Tr{I){A) = a^l(. 

As we consider the hypothesis Hu, we will infer information only with Tr. 
sernS,^ is the least fixpoint of Tr with respect to the knowledge ordering, thus 
sem-p^ coincides with the Kripke-Kleene semantics of V which is the least 
fixpoint of $-p with respect to the knowledge ordering. 

□ 



5 Concluding remarks 

We have defined a formal framework for information integration based on 
hypothesis testing. A basic concept of this framework is the support provided 
by a program V — {F, R) to a hypothesis H. The support of H is the maximal 



25 



part of H that does not contradict the facts of F or the facts derived from 
F using the rules of R. 

We have then used the concept of support to define hypothesis-based 
semantics for the class of Fitting programs, and we have given an algorithm 
for computing these semantics. 

Finally, we have shown that our semantics extends the well-founded se- 
mantics and the Kripke-Kleene semantics to multi-valued logics with bilat- 
tice structure, and also generalizes them in the following sense: if we restrict 
our attention to three-valued logics then for Hjr the everywhere false inter- 
pretation our semantics reduces to the well-founded semantics, and for Hu 
the everywhere under defined interpretation our semantics reduces to the 
Kripke-Kleene semantics. 

We believe that hypothesis-based semantics can be useful not only in the 
context of information integration but also in the context of explanation- 
based systems. Indeed, assume that a given hypothesis H turns out to be 
a part of the if-semantics of a program V. Then V can be seen as an 
"explanation" of the hypothesis H. We are currently investigating several 
aspects of this explanation oriented viewpoint. 
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